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<T> 1. INTRODUCTION 

Q 

This report presents recent works extending sieve methods, from their classical setting, to 
T—i new situations characterized by the targeting of sets with exponential growth, arising often 

from discrete groups like SL m (Z) or sufficiently big subgroups. 

A recent lecture of Sarnak [33] mentions some of the original motivation (related to the 
Markov equation and closed geodesies on the modular surface). The first general results con- 
cerning these sieve problems appeared around 2005 in preprint form, and Bourgain, Gamburd 
and Sarnak have written a basic paper presenting its particular features Other applica- 
tions, with a very different geometric flavor, also appeared independently around that time, 
first (somewhat implicitly) in some works of Rivin [52J. 

J> The most crucial feature in applying sieve to these new situations is their dependence 

on spectral gaps, either in a discrete setting (related to expander graphs or to Property (r) 

[- — , of Lubotzky [ID]) or in a geometric setting (generalizing for instance Selberg's result that 

Ai 3/16 for the spectrum of the Laplace operator on the classical hyperbolic modular 

CN surfaces). 

The outcome of these developments is that there now exist very general sieve inequalities 
involving, roughly speaking, discrete objects with exponential growth. Moreover, their ap- 
plicability (including to problems seemingly unrelated with classical analytic number theory, 
^ as we will show) has expanded enormously, as - partly motivated by these new applications 

of sieve methods - many new cases of spectral gaps have become available. Particularly im- 
pressive are the results on expansion in finite linear groups (due to many people, but starting 
from the breakthrough of Helfgott [26] for SL2), and those concerning applications of ergodic 
methods to lattices in semisimple groups with Property (r) (developed most generally by 
Gorodnik and Nevo |21|). 



Before going towards the heart of this report, we state here a particularly concrete and 
appealing result arising from sieve in expansion. We recall first that Cl(n) is the arithmetic 
function giving the number of prime factors, counted with multiplicity, of a non-zero integer 
n, extended so that $7(0) = +00. 
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Theorem 1.1. — Let A C SL m (Z) be a Zariski-dense subgroup, for instance the group L 
generated by the elements 



in the case m = 2 

Let f be an integral polynomial function on Z m , which is non-constant. Let xq E Z m — {0} 
be a fixed vector. There exists an integer r = r(f, xq, A) ^ 1 such that the set 



is Zariski-dense in SL m , and in particular is infinite. In fact, there exists such r for which 
Of(xo;r) is not thin, in the sense of Def. 3.1.1]. 

Part of the point, and it will be emphasized below, is that A may have infinite index 
in SL m (Z). In particular, for m = 2, this is the case for the group L generated by the 



Notation. We recall here some basic notation. 

- The letter p will always refer to prime numbers; for a prime p, we write F p for the finite 
field Z/pZ, and we write F q for a field with q elements. For a set X, \X\ is its cardinality, a 
non- negative integer or +oo. 

- The Landau and Vinogradov notation / = 0(g) and f g are synonymous, and f{x) = 
0(g(x)) for all x G D means that there exists an "implied" constant C ^ (which may 
be a function of other parameters, explicitly mentioned) such that |/(x)| ^ Cg(x) for all 
x G D. This definition differs from that of N. Bourbaki [TJ Chap. V] since the latter is of 
topological nature. On the other hand, the notation f(x) ~ g(x) and / = o(g) are used with 
the asymptotic meaning of loc. cit. 

Acknowledgments. Thanks are due to J. Bourgain, N. Dunfield, E. Fuchs, A. Gamburd, 
F. Jouve, A. Kontorovich, H. Oh, L. Pyber, P. Sarnak, D. Zywina and others for their help, 
remarks and corrections concerning this report. In particular, discussions with O. Marfaing 
during his preparation of a Master Thesis on this topic |43j were very helpful. 



2. MOTIVATION 

Sieve methods are concerned with multiplicative properties of sets of integers. Thus to 
expand the range of the sieve, one should describe new sets of integers to investigate. To 
present the spirit of this survey, we first give two examples of such sets of integers, which are 
rather unusual from a sieve perspective. One of them is a particularly appealing instance of 
"sieve in orbits", first considered in the distribution of curvatures of integral Apollonian 
circle packings. The second is even more surprising to look at: it has to do with the first 
homology of certain "random" 3-manifolds. Although we will say rather less about it later 
on, it presents some unusual features, and suggests interesting questions. 



(1) 




Q f (x ;r) = { 1 eA | J2(/( 7 • x )) < r} 



matrices Q. 
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2.1. Apollonian circle packings 

It is a very classical geometrical fact that, given three circles (Oi> 02,0s) hi the plane 
which are pairwise tangent to each other, and have disjoint "interiors" (the discs they bound), 
with radii (r\, r 2 , r 3 ) and curvatures (ci, c 2 , C3) = (r^ 1 , r^ 1 , r^ 1 ), one can find two more circles 
(say (04,01) with curvatures (04,04)), so that both 

(Oi, O2, 03, 04) and (Oi, O2, Oa, 01) 

are four pairwise tangent circles (with disjoint "interiors"). In fact, this applies also to negative 
radii or curvatures, where a negative radius is interpreted to mean that the "interior" of the 
circle should be the complement of the bounded disc (see Figure 1). Such 4-tuples are called 
Descartes configurations, since a result of Descartes states that the two sets of four curvatures 
satisfy the quadratic equations 

Q(ci, c 2 , c 3 , c 4 ) = Q(ci, c 2 , c 3 , c 4 ) = 

where 

Q(x, y, z, t) = 2{x 2 + y 2 + z 2 + t 2 )-(x + y + z + t) 2 . 

In particular, if (Oi, O2, 03, O4) are such that their curvatures are all integers, then we 
obtain an integral quadratic equation for c 4 where one solution (namely, C4) is an integer: 
thus c 4 is an integer also. Moreover, again given (01,02,03)04) with integral curvatures, 
there are also circles 

Oi,0' 2 ,03. 

for which, for instance, the circles 

(01,02,03,04) 

form a Descartes configuration, and as above, the curvatures d x , c' 2 , c' 3 are integers. In fact, 
these new curvatures are given - by solving the quadratic equation using the known root - as 

(c'l' C 2, c 3,C4) = (C1,C 2 ,C 3 ,C 4 ) • t Si, 
(Cl, C 2 , C3, C 4 ) = (Cl, C 2 , C3, C4) • *S 2 , 
(Cl, C 2 , C 3 , C4) = (C1,C 2 ,C3,C4) • *S3, 
(Cl, C 2 , C3, C 4 ) = (C1,C 2 ,C3,C4) • *S4, 

where the matrices s±, s 4 are in the group 0(Q,Z) of integral automorphisms of the 
quadratic form above, namely 

\ 

1 2 2 
1 

1/ 

and S3 and S4 are similar. Note that s? = 1 for all i, and one can in fact show that these are 
the only relations satisfied by those matrices. 

Each of the new sets of curvatures can be used to iterate the process; in other words, 
denoting by A the subgroup of 0(Q, Z) generated by the Si, the integers arising as coefficients 
of a vector in the orbit A ■ c of a "root quadruple" c = (c\, . . . , c 4 ), are all the curvatures of 
circles arising in this iterative circle packing. These are Apollonian circle packings, and the 
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Figure 1. Apollonian circle packing for c = (—6, 11, 14, 15); the labels are the curvatures. 

first step is described in Figure 1 in one particular case (where one notices the convention 
dealing with negative curvatures). 

The set 6(c) of these curvatures, considered with or without multiplicity, is our first example 
of integers to sieve for. It is clear from the outset that such an attempt will be deeply connected 
with the understanding of the group A. Moreover, an interest in the multiplicative properties 
of the elements of 6(c) will obviously depend on the properties of the reduction maps 

A -> A p = A (mod p) C 0(Q, Z/pZ), 

modulo primes, and in particular in the image of this reduction map. 

The following features of A illustrate a basic property that makes the question challenging: 

- The group A is "big" in some sense: it is Zariski-dense in 0(Q) (as a Q-algebraic group), 
so that, if one can only use polynomial constructions, A is indistinguishable from the big Lie 
group 0(Q,C), 

- However, A is "small" in some other sense: specifically, A has infinite index in 0(Q, Z). 
Stated differently, the quotient A\0(Q,TL) (a three-manifold) has infinite volume for its nat- 
ural measure, induced from Haar measure on 0(Q, R). 

Remark 2.1. — Arithmetic properties of Apollonian packings were first discussed in [23J, in- 
cluding some properties of 6(c); the use of sieve to study 6(c) was begun in [3J. 

2.2. Dunfield-Thurston random 3-manifolds 

Our second example of integers to sieve from is chosen partly as an illustration of the great 
versatility of sieve, and partly because it will lead to some interesting examples later. It is 
based on a paper of Dunfield and Thurston [T3] (which did not explicitly introduce sieve); 
related work is due to Maher [42J and the author [35j (where sieve is explicitly present). 

Let g ^ 2 be a fixed integer. Fix also a handlebody H g of genus g; it is a connected oriented 
compact 3-manifold with boundary, and this boundary surface T, g is a surface of genus g (com- 
pact connected and oriented); in other words, for g = 2, H g is a solid double "doughnut", and 
Yi g its 2-dimensional boundary. A very classical way (going back to Heegaard) of constructing 
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compact 3-manifolds (also connected, oriented, without boundary) is the following: take a 
homeomorphism of T, g , and consider the manifold 

Ma, = H g Hg 

obtained by gluing two copies of H g using the map <f> to identify points on their common 
boundary. 

As may seem intuitively reasonable, the manifold Ma, does not change if <j) is changed 
continuously; this means that Ma, really depends only on the mapping class of <f) in the mapping 
class group T g of S g (roughly, the group of "discrete" invariants of the homeomorphisms of 
the surface; these groups, as was recently discussed in this seminar [50], have many properties 
in common with the groups SL m (Z), or better with their quotients Sp 2g (Z)). 

The topic of |13j (which is partly inspired by the Cohen-Lenstra heuristic for ideal class 
groups of number fields) is the investigation of the statistic properties of the fundamental 
group it i (Ma,) when <p is taken as a "random" elements of T g (in a sense to be described 
precisely below), in particular the study of the abelianization Hi(Ma,, Z) of 7Ti(M^), motivated 
by the virtual Haken Conjecture (according to which every compact 3-manifold with infinite 
fundamental group should have a finite covering N such that the abelianization of tx\ (N) is 
infinite). 

Motivated by this, our second example of set of integers is, roughly, the set of the orders of 
torsion subgroups of H\(Ma„ Z), as (f> runs over T g . Or rather, since here multiplicity is very 
hard to control, one should think of this as the map 

T g -> |ffi(M , Z)| G {0, 1, 2, 3, . . .} U {+oo}. 

It may be fruitful to think about these integers from a sieve point of view because of the 
"local" information given by the homology with coefficients in F p for p prime, namely 

Hi(Ma,, Z) ® Z/pZ = H X (M^ Z/pZ), 

and the sieve-like description 

dimHi(MA,, Z) ® Q > 1 <=^ (For all primes p, dim z/pZ H X (M^ Z/pZ) > 1) 

of the manifolds with infinite first homology (this is because Hi(Ma,, Z) is a finitely-generated 
group). 

A certain similarity with the previous example arises here from the fact that, as shown by 
Dunfield and Thurston, there is a natural isomorphism 

(2) fli(Afy,Z)~V7(J,&,J> 

where V = Hi(T, g ,Z) ~ Z 2 ' J is the first homology of the surface E ff , J is the image in V of 
Hi(H g ,Z) ~ Z 9 , which is a (fixed!) Lagrangian subspace for the intersection pairing on V, 
and 4>* denotes the induced action of <p on V. Thus Hi(M^, Z) only depends on 4>*, which is 
an element of the discrete group Sp(V) ~ Sp 2s (Z) of symplectic maps on V, with respect to 
the intersection pairing. Moreover, the reduction modulo p is given by 

(3) i?i(M , Z/pZ) ~ V P /(J P , ^J p ) 

where V p = V/pV, J p = J/pJ, and therefore it depends only on the reduction modulo p of 
0*, an element of the finite group Sp(V/pV) ~ Sp 2g (Z/pZ). 



1028-06 



3. A QUICK SURVEY OF SIEVE 

In this section, we survey quickly some of the basic principles of sieve methods, and state 
one version of the fundamental result that evolved from V. Brun's first investigations. Our goal 
is to make the sieve literature, and its terminology and notation, accessible to non-experts. 
In particular, this section is essentially self-contained; only in the next one do we start to fit 
the examples above (and many others) in the sieve framework. 

3.1. Classical sieve 

The classical sieve methods arose from natural questions related to the way multiplicative 
constraints on positive integers (linked to restrictions on their prime factorization) can interact 
with additive properties. The best known among these questions, and a motivating one for 
V. Brun and many later arithmeticians, is whether there exist infinitely many prime numbers 
p such that p + 2 is also prime, but the versatility of sieve methods is quite astounding. For 
many illustrations, and for background information, we refer to the very complete modern 
treatment found in the recent book of J. Friedlander and H. Iwaniec (16] . 

In the usual setting, the basic problem of sieve theory is the following: given a sequence 
3~ = (a n )n>i of non-negative real numbers (usually with a finite support, which is supposed 
to be a parameter tending to infinity) and a (fixed) subset "? of the primes (e.g., all of them), 
one seeks to understand the sum 

S($,z) = ^ a n: where P{z) = JJp, 

(n,P(z))=l p<z 
which encodes the contribution to the total sum 

of the integers not divisible by the primes in 3 which are < z, and one wishes to do so 
using properties of the given sequence which are encoded in sieve axioms or sieve properties 
concerning the congruence sums 

(4) S d (T) = °- 

n=0 (mod d) 

The fundamental relation between these quantities is the well-known inclusion- exclusion 
formula^ 

S(?,z)= Kd)S d (?), 

d\P(z) 

and the basic philosophy is that, for many sequences of great arithmetic interest, the con- 
gruence sums above can be understood quite well. Indeed, the notion of "sieve of dimension 
k > 0" arises as corresponding to a sequence for which, intuitively, the "density" of the 
sequence over integers divisible by a prime number p (in T) is approximately ftp -1 , at least 
on average over p, i.e., we have 

(5) S d (?)=g(d)S(J) + r d (?), 



Where fi(d) is the Mobius function, for non-squarefree integers, and otherwise equal to (— 1) 
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where r^S") is considered as a "small" remainder and g is a multiplicative function of d ^ 1 
for which g(jp) satisfies 

(6) g{p) = 'l + o(p- 1 - s ) 

for some 6 > (or even weaker or averaged versions of this, such as 

(7) J^5(p)logp = n\ogx + 0{l) ; 

since we have 

logp 



£^ = log* + 0(l) 



P 

for x ^ 2, by the Prime Number Theorem, such an assumption is consistent with the heuristic 
suggested above). 

This dimension condition often means that the sum S(H, z) corresponds to the number of 
integers (in a finite sequence) which, modulo the primes p £ IP, must avoid n residue classes. 

Example 3.1. — A characteristic example is the sequence 9~/ = 3^f,x, associated with a fixed 
monic polynomial / G Z[T] of degree r ^ 1 and a (large) parameter X, defined as the 
multiplicity 

(8) a n = \{m^X | f(m)=n}\ 

of the representations of an integer as a value f(m) with m ^ X. In that case, if "P is the set 
of all primes and z ~ X x l 2r ', it follows that 

(9) S($, z) = \{m ^ X | f(m) has no prime factor < z}\, 
and in particular, if z ~ X 1 / 2 *", we get 

5(3", z) = ({primes S> X r//2 of the form f(m) with m ^ X}|, 

a function of much arithmetic interest. 

One immediately notices in this example that, to be effective, sieve methods have to handle 
estimates uniform in terms of the support of the sequence, and in terms of the parameter z 
determining the set of the primes in the sieve, as the latter will be a function of the former. 

The congruence sums modulo d ^ 1 squarefree are easy to understand here: we have 

s d (? f ,x)= £ i, 

d\f(m) 

and by splitting the sum over m into residue classes modulo d and denoting 

p f {d) = \{a e Z/dZ | f{a) = (mod d)}\ 
the number of roots of / modulo d, we find 

(10) s d (? f , x) = £ ^ i = eM x + o{\) 

aeZ/dZ msgX 
/(a)=0 (mod d) ™=a (mod d) 

for X ^ 2, where the implied constant depends on /. The Chinese Remainder Theorem 
shows that d i-> pf(d) is multiplicative, and then from the Chebotarev density theorem (in 
the general case, though a trivial argument suffices if f(T) = (T — a\) ■ ■ ■ (T — a r ) splits 
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completely over Z, which corresponds to the Hardy-Littlewood prime tuple conjecture ( 2 ) ), we 
know that 

Pf(j>) 



^mpi = K io g i og:c + o(i) ) 



V 

p^x 

where k = n(f) is the number of irreducible factors of / in Q[T], so that the sieve problem 
here is of dimension k. (E.g., for f(T) = T 2 + 1, we have k = 1; there is, on average over 
primes p, one square root of —1 in Z/pZ.) 

A number of very sophisticated combinatorial and number-theoretic analysis (starting with 
Brun) have led to the following basic sieve statement (see, e.g., |16} Th. 11.13], where more 
details are given): 

Theorem 3.2. - - With notation as above, for a sieve problem of dimension k > 0, there 
exists a real number /3(k) > such that 

(/( S ) + 0(log J D)- 1 / 6 ))5(J) (l-g( P )) + R(D)^S(3,P) 

p\P(z) 

^(F(s) + 0((logD)- 1 / 6 )S(^) J] (l-g(p)) + R(D) 

p\P(z) 

for z = D 1 ^ with s > (3(k), where F(s) > and f(s) > are certain functions of s ^ 0, 
depending on k, defined as solutions of explicit differential- difference equations, such that 

lim /(s) = lim F(s) = 1, 

s— >+oo s— >+oo 

and where 

R(D) = J2\r d (n- 

d<D 

In both upper and lower bounds, the implied constant depends only on n and on the constants 
in the asymptotic Q. 

The leading term in the upper and lower bounds has a clear intuitive meaning: one con- 
gruence condition leads to a proportion 1 — g(p) of the sum S^S") that "passes" the test of 
sieving by p, and multiple congruence conditions behave - up to a point - as if they were 
independent. In particular, as soon as the remainders in §5§ are small for fixed d, we have an 
asymptotic formula for sieving with a fixed set of primes, as the support of the sequence $ 
grows. 

Since g(p) is about np~ l on average, the Mertens formula leads to 

P \p 

for z = X l / S for any fixed s > 0. The following definition is therefore a natural expression of 
the fact that one needs R to be of smaller order of magnitude to obtain actual consequences 
from Theorem 13.21 



( - 2 ' Which played an important role in the recent results concerning small gaps between primes of Goldston, 
Pintz and Yildirim. 
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Definition 3.3 (Level of distribution). — Let (3~ n ) be sequences as above and D n > 0. The 
3~ n have level of distribution D n if and only if 

(11) Rn = \ r d(3n)\ < S(? n )(logD n )- B 

d<D„ 

for any B > and n ^ 2, the implied constant depending on B. 



Example 3.4- — In the context of Example 3.1 for / £ Z[T] of degree r, with k irreducible 
factors, we have 

r d {?f,x) < d £ 

for all <i ^ 1 squarefree and e > 0, the implied constant depending on e. Since S(3^f t x) X we 



see that the level of distribution is ^ -D for any D = X 1 ^ 5 for 5 > 0. Applying Theorem 3.2 
with 2 = D 1 ^, s large enough, we deduce that there exists r(f) ^ 1 such that there are 
infinitely many positive integers m such that f(m) has at most r(/) prime factors, counted 
with multiplicity (and in fact, there are S> X/(\ogX) K such integers m ^ X). 

3.2. The sieve as a local-global study 

The point of view just described concerning sieves is very efficient. However, we will use an 
essentially equivalent formal description which is more immediately natural in the applications 
we want to consider later. 

In this second viewpoint, we start with a set Y of objects of "global" (and often arithmetic) 
nature. To study them, we assume given maps 

Y^Y p 

for p prime which are analogues of (and often defined by) "reduction modulo p" . Often, 
Y parametrizes certain integers, and the Y p parametrize their residue classes modulo p. To 
emphasize this intuition, we write y (mod p) for the image in Y p of y G Y. We interpret 
these maps as giving local information on objects in Y, and we assume that Y p is a finite 
set. It is often the case that Y — > Y p is surjective, but sometimes it is convenient to allow for 
non-surjective reduction maps. 

Using such data, we can form sifted sets associated with a set 7 of primes, some z ^ 2, and 
some subsets f2 p C Y p , namely 

(12) § Z (Y; Q) = {y £Y \ y (mod p) £ % for all p € p < z) C Y. 

To "count" the elements in this sifted set, we consider quite generally that we have available 
a finite measure /i on F, and the problem we turn to is to estimate (asymptotically, or from 



above or below), the measure H(S Z (Y; fi)) of the sifted setFn We assume for simplicity that 
for given y 6 Y, the set of p £ 7 with y (mod p) £ Q p is finite (this holds in most applications). 

This question can be interpreted in the previous framework as follows: for any y E Y, we 
define 

n (y) = n p 

y (mod p)£$7p 



' 3 ' It is hoped that fi will not be mistaken with the Mobius function. 
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for y EY, with the convention that n(y) = if the product is infinite. This is a non-negative 
integer depending on y such that the "adjunction" property 

I n(y)) <=^ (y (mod p) £ Q p ) 

holds for all p £ 7 if n{y) 1. 

Although the case n(y) = is usually exceptional. ( 4 ) it may occur. In order to take it into 
account, we define 

Y° = {y E Y | n(y) = 0}, Y + =Y — Y°. 



We define the sequence 3~ = (a n ) n ^i b} ( 5 ) 
(13) a, 

It follows that 



= fi({y E Y | n(y) = n}). 



and 



S(?,z)= J2 *n = v({y£Y + I (n(y),P(z)) = l}) = f i(§ z (Y+;Q)). 

(n,P(z))=l 



Example 3.5. — Example 3.1 may also be interpreted in this manner: we take Y to be the 
set of positive integers, \i to be the counting measure restricted to integers 1 ^ m ^ X in Y, 
the reduction maps to be Y — > Z/pZ, which are indeed surjective maps onto finite sets. With 



{a E Z/pZ | /(a) = 0} 



the set of zeros of / modulo p, it is clear that 

n{S z (Y;n)) = S(?,z) 

is the quantity given in |9) 



(6) 



Similarly, the congruence sums <Sd(3~) are now given by 

S d (J) = fi({y E Y + | y (mod p) E Q p for all p | d}) 
for d squarefree. This is also the measure of the set 

p\d p\d 

under the image measure of [x by the map of simultaneous reduction modulo p \ d on Y + . 

It is therefore very natural to take the following point of view towards the dimension 
condition first, we expect that for p prime - provided the measure [i counts "a large 
part" of Y -, we can write 

Kiv e Y I y ( mod p) = Q l) ~ K Y ) u p( a ) 



In counting questions below, it will have a negligible contribution. 
We assume of course that all sets {y | n(y) — a} are measurable. 

The set Y + is here the set of m ^ X such that f(m) 7^ 0; in particular, Y° is a finite set. 
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for all a, where v p is some natural probability measure on the finite set Y p ; secondly, we expect 
that for d squarefree, the reductions modulo the primes p dividing d are (approximately) 
independent, so that 

(14) fi({y E Y | y (mod p) = a p for all p \ d}) « fj,(Y) v p {a p ). 

P \d 

If we compare this with ^ , this corresponds to taking 

g(p) = v p (Clp), g{d) = Y[v p (Q p ), 

P \d 

and fits well with the intuitive meaning of g(d) as encoding the density of the sequence 
restricted to n divisible by d. Most crucially, we note that assuming that g is multiplicative 
is essentially equivalent with the expected asymptotic independence property (14). 

We now proceed to make approximations ( 14 ) precise, and interpret the level of distribution 
condition. This makes most sense in an asymptotic setting, and we therefore assume that we 
have we have a sequence (/J, n ) of finite measures on Y (corresponding to taking X — > +oo in 
Example 3.1 ). 

For any fixed squarefree number p, we consider the image of the associated probability 
measures 

jj-n = ^n/^n{Y), 

on the finite set 

Y d = l[Y d , 

P \d 

which are probability measures fl nid on Y d . 

Definition 3.6 (Basic requirements). — Sieve with level D n ^ 1 is possible for the objects 
in Y using Y — > Y p , and for a sequence (/U n ) of measures on Y , provided we have: 

(1) [Existence of independent local distribution] For every d squarefree, the image measures 
fi n ^d converge to some measure v d , and in fact 

Vd = n u p- 

P \d 

In other words, there are probability measures v d on Y& such that 

Vn({y e Y | y (mod p) = a p }) = (j, n (Y)(v d (a) + r djn (a)) 
for all d and a = (oip) p \ d £ Y d , and 

lim r dn (a) = 

n— >+oo 

for all fixed d and a E Y^. 

(2) [Level of distribution condition] Given subsets £l p C Y p , and 

&d = Y\_ ^p> 

P \d 

we have 

(15) V \n d \max\r d>n (a)\ < (logD n y A 

d<D n 

(16) fi n (Y°)<^fi n (Y)(logD n )- A 
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for all n 1, the implied constant depending on A and the SI 



v 



These assumptions are a form of quantitative, uniform equidistribution results for the re- 
ductions of objects of Y modulo primes, and of independence of these reductions modulo 



various primes. Much of the difficulty of the sieve in orbits (as in Section 2.1 or more gener- 



ally, as described in the next section) lies in checking the validity of these conditions. We will 
present some of the techniques and new results in the next sections. 

However, if we assume that the basic requirements are met, we may combine these condi- 
tions with the fundamental statement of sieve, to deduce the following general fact. Given 
sets Q p C Y p such that 

(17) u p (n p ) = ^ + 0{p- l - & ) 

(a condition which is local and does not depend at all on 1") for some 5 > and p prime, we 
obtain by summing over Q p that the congruence sums satisfy 

Hn{y £ Y | y (mod p) £ ft p for p\d) = n n {Y)v d (VL d ) + r d>n {Cl) 

where 

rd,n(Q) < |^rf|max|r rfn (a)|. 

By independence, g(d) = \£l d \ is multiplicative. Hence we derive (weakening the conclusion 
of Theorem 3.2) that 

(18) fi n ({y £ Y \ y (mod p) $ Q p for all p < D l r / S }) x ( ^\ K 

(log L) n ) 

for n ^ 1 and for all fixed s large enough (in terms of k). 



Example 3. 7. — This description applies easily to Examples |3.1| and |3.5| with the sequence 
(nx) of counting measures on 1 ^ m ^ X. The measure v p is simply the uniform probability 
measure on Z/pZ, and the independence (which passed almost unnoticed) is valid, as an 
expression of the Chinese Remainder Theorem: knowing the reduction modulo a prime p\ of 
a general integer n gives no information whatsoever on its reduction modulo another prime 
P2- Quantitatively, we have 

\{m ^ X | m = a (mod d)}\ = — + O(l), 



and we recover the level of distribution D = X 1 5 with 5 > of Example 3.4 (This is about 



the best one can hope for.) Note that condition (16) is here trivial since 1"° is the finite set 
of positive integral roots of /. 



Remark 3.8. — The upper-bound (16) is obtained, in all the cases discussed in this report, 
as an immediate consequence of (15). Indeed, in all cases we consider, it will be true that 
n{y) = means that y (mod p) G f2 p for all p. One can then write 

»n(Y°) ^ n n ({y £Y \ y (mod p) G Q}) 

for any fixed p, and under the assumptions above, we find for p ^ D n that 

Vn(Y°) ^ fi n (Y){v p {n p ) + 0(\n p \ max \r Pjn (a)\)\ 

« Mn (Y){^ + 0((lo gj D n r A )}, 
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so we derive (16) by taking p of size comparable with (logD n ) A . 

4. SIEVE IN ORBITS 

We will now present the general version of the sieve problem developed by Bourgain, Gam- 



burd and Sarnak, which generalizes the example of Apollonian circle packings (Section 2.1). 



As with the group A, the global objects of interest are directly related to discrete groups with 



exponential growth. (Further discussion of the example in Section 2.2 is found in Section 6.1 ) 



4.1. The general setting 

Consider a finitely-generated group A C GL m (Z) for some m ^ 1 (e.g., A C GLi(Z), the 
"Lubotzky" group L of in SL 2 (Z), or SL m (Z) itself). 
Given a non-zero vector xo E Z m , form the A-orbit 

O(x ) = A ■ x C Z m , 

and fix some polynomial function / G Q [X\ , . . . , X m ] such that / is integral valued and 
non-constant on O(xq) (for instance, / £ Z[Xi, . . . , X m ]). The "philosophical" question to 
consider is then 

"To what extent are the values f(x), where x G O(xo), typical integers?" 

More precisely, as far as sieve is concerned, the question is: how do the multiplicative 
properties (number and distribution of prime factors) of the 7(7x0) differ from those of general 



integers'; ( 7 ) From the point of view of Section 3.2, we are therefore looking at Y = 0(xo), 
and its reductions modulo primes 

Y^Y p = O(x (modp)), 

the latter being the orbit of xq (mod p) under the action of A p , the image of A under reduction 
modulo primes. 

Remark 4-1- — One may also consider other actions of A; for instance taking Y = A itself is 
natural enough, with Y p the reduction of A modulo p. However, if one considers the image 
of A in GL m 2(Z) corresponding to the multiplication action of A on GL m (Z) on the left, the 

2 

group A is isomorphic to the orbit of the vector "identity" xo = 1 £ M m (Z) ~ Z m . 

Later we will explain how to count elements of Y in order to apply the basic sieve statements. 
However, there is a first qualitative way of phrasing the guess that there should be many 
elements x of O(xq) with f(x) having few prime factors, which was pointed out in [4j. Namely, 
let 

O/(x ;r) = {xe 0(x ) I n(/(x)) <r} 
for r ^ 1 and define the "saturation number" for the orbit: 

r(f,A) = min{r ^ 1 | Of(xo',r) and O(xo) have the same Zariski-closure} , 

or in other words, the smallest r ^ 1 such that the elements of Of(xo;r) satisfy no further 
polynomial relation than those of the full orbit O(xo). One asks then: 

( - 7 ' For readers unfamiliar with what this means, the Appendix gives some of the most basic statements along 
these lines. 
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Question 4.2. — Is this "saturation number" finite, and if yes, what is it? 

Example 4-3. — When m = 1, a set in Z is either Zariski-closed, if finite, or has Zariski- 
closure the whole affine line. Thus the existence of saturation number for an infinite subset 
of Z amounts to no more (but no less) than the statement that is infinite. 

However, with m ^ 2, the saturation condition may become quite interesting. The following 
example is discussed in [4, §6, Ex. C]: consider the orbit of xq = (3,4,5) under the action 
of the orthogonal group A = SO(2,l)(Z). This orbit is the set of integral Pythagorean 
triples, and its Zariski closure is the cone {x 2 + y 2 — z 2 = 0}. Considering the function 
f(x, y, z) = xy/2 (the area of the right-triangle associated with (x, y, z)), and using the recent 
quantitative results of Green and Tao [25] on the number of arithmetic progressions of length 
4 in the primes, Bourgain, Gamburd and Sarnak show that the saturation number is 6 in that 
case. However, it is highly likely (this follows from some of the Hardy-Littlewood conjectures) 
that there are infinitely many integral right-triangles with area having ^ 5 prime factors! 
These triangles, however, have sides related by a non-trivial polynomial relation. 

It seems interesting to to sharpen a bit the strength of the saturation condition by replacing 
the condition "Zariski-dense" with the stronger condition that Qf(xo; r) be not thin. We recall 
the definition (see [571 §3-1]): 

Definition 4.4 (Thin set). — Let V/k be an irreducible algebraic variety defined over afield 

k of characteristic zero. A subset A C V(k) is thin if there exists a k-morphism W — > V of 
algebraic varieties such that f has no k-rational section, dim(W) ^ dim(V) and 

A C f{W{k)). 

Example 4-5. — For m = 1, there are many infinite thin sets in Z. However, one shows that 
such a set (say T) satisfies 

\{n e T | \n\ sC X}\ < X^ 2 (\ogX) 

for X ^ 2 (a result of S.D. Cohen, see, e.g., [57J Th. 3.4.4]), so that even Chebychev's 
elementary bounds prove that the set of prime numbers is not thin. The example of the set 
of squares shows that the exponent 1/2 is best possible. 



We now see that Theorem 1.1 states, for certain groups A and their orbits, that the satura- 
tion number is finite, even that 0/(xo; r) is non-thin. We will sketch below the proof of this 
result, in a slightly more general case. Interestingly, although [1] only considers the original 
saturation condition, the values of r for which they prove that Of(xo;r) is Zariski-dense are 
always such that it is not thin. 

4.2. How to count? 

In the setting of the previous section, we have many examples of a set Y with reduction 
maps Y — > Y p , and we wish to implement the sieve techniques, for instance to prove that some 
saturation number is finite. For this purpose, as described in Section [3j it is first necessary to 
specify how one wishes to count the elements of Y, or in other words, one must specify which 



finite measures (^ n ) 5 will be used to check the Basic Requirements of sieve (Definition 3.6). 

A beautiful, characteristic, feature of the sieve in expansion is that there are often two or 
three natural ways of counting the elements of Y. We illustrate this in the case that Y = A 
is a (finitely generated) subgroup of GL m (Z). One may then use: 
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- [Archimedean balls] One may fix a norm || • || on GL m (R), and let \xx be the uniform 
counting measure on the finite set 

B A (X) = {g € A C GL m (R) | \\g\\ < X} 

for some X ^ 1. 

- [Combinatorial balls] One may fix instead a finite generating set S of A, assumed to be 
symmetric (i.e., s E S implies s _1 S S), and use it to define first a combinatorial word- length 
metric, i.e., 

\\g\\s = minj/c ^ | g = si ■ ■ ■ Sk for some si, . . . , Sf- € S 1 }. 

Then one can then consider the uniform counting probability measure Hk on the finite 
combinatorial ball 

B s (k) = {g e A | || 5 || s s: fc} 

for ^ 1 integer. This set depends on S, but many robust properties should be (and are) 
independent of the choice of generating sets. 

[Random walks] Instead of the uniform probability on combinatorial balls, it may be 
quite convenient to use a suitable weight on the elements of Bs(k) that takes into account 
the multiplicity of their representation as words of length k. More precisely, we assume that 
1 G S (adding it up if necessary) and we consider the probability measure on A such that 

l*k(9) = rL: 1 
( Sl ,...,s k )eS k 

si--s k =g 

(adding 1 to a generated set S, if needed, ensures that this measure is supported on the 
S'-combinatorial ball or radius k, and not the combinatorial sphere.) 

Compared with the previous combinatorial balls, the point of this weight is that it simplifies 
enormously any sum over Bs(k): we have 

f(9)^k{g) = -ri ' ' ' v( Sl ' ' ' 5fc )> 

g£\ 1 1 si,...,s k eS 

for any function (p on A, where the summation variables on the right are free. 

Remark 4-6- — This third weight has a natural probabilistic interpretation: fi^ is the law of 
the k-th step of the left-invariant random walk (Xk)k^o on A, defined by 

Xo = 1 G A Xk+i = Xkik+i, 

where (Cfc)fc^i is a sequence of S- valued independent random variables (on some probability 
space (f2,£,P)) such that 

P(6c = s) = — for all k^ 1 and s£S. 
Pi 



Once the counting method (fix) is chosen, the more precise form of Question 4.2 becomes 
to bound from below the function 

7r f (X;r) = fix(ie A | n(/( 7 .a:o))<r) 
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as X —7- +00. Precisely, the idea is to prove - for suitable r - a lower bound which is sufficient 
to ensure that Of(xo;r) is Zariski-dense, by comparison with upper bounds (known or to be 
established) for the counting functions 

Hx{l G A I f(-y-x ) G W) 

for a proper Zariski-closed subvariety W C V = 0(xq) (or for a thin subset W C V(Q) H Z m ). 



5. THE BASIC REQUIREMENTS 

Consider A and an orbit 0(xq) as in the previous section, and assume a counting method 
(i.e., a sequence (nx) of measures on Y) has been selected. We now proceed to check if the 
basic requirements of sieve are satisfied. 

However, we first make the following assumption (which will be refined later): 

Assumption 5.1. — The Zariski-closure G/Q of the group A C GL m (Z) is a semisimple 
group, e.g., SL m; or Sp 2g or an orthogonal group, or a product of such groups. 

For instance, this means that we exclude from the considerations below the subgroup gen- 
erated by the single element 2 G GLi(Z[l/2]) and its orbit {2™} C Z[l/2] (we extend here 
the base ring slightly); this is understandable because the question of finding integers n for 
which, say, f(2 n ) = 2 n — 1, is prime remains stubbornly resistant. Indeed, the basic results 
used to understand the local information available from A — > A p simply fail in that case 



(see Remark 5.5 below). In |H §2], Bourgain, Gamburd and Sarnak give some more exam- 
ples that show why general reductive groups lead to very different - and badly understood - 
phenomena. 

The same reason make solvable groups (e.g., upper-triangular matrices) delicate to handle; 
as for nilpotent groups, they are definitely accessible to sieve methods. However, their behavior 
(in terms of growth) is milder and one would not need the considerations of expansion in groups 
that are needed for semisimple groups. 

5.1. Local limit measures 



According to Section 3.2, we start the investigation of sieve by checking whether, for a fixed 



squarefree integer d, the measures jj,x,d on 

P \d 

have a limit, and whether this limit is a product measure. This last condition is crucial, 
and it sometimes require more preliminary footwork. The basic difficulty is illustrated in the 
following situation: suppose that, for some set Z, there are non-constant maps 

N : Y -> Z, N p : Y p ^ Z, 

and that 

N(y)=N p (y (mod p)) 

for all primes p. Then y (mod p) does carry some information concerning y, namely the value 
of N(y). Consequently, it is not possible for y (mod P1P2) to become equidistributed towards 
a product measure u pi x u P2 . 
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Example 5.2. — In the sieve in orbits, this happens frequently when orthogonal groups are 
concerned. For instance, the Apollonian group A is not contained in SO(Q, Z) and we have 

det(7) = det(7 (mod p)) 

for all p. (In fact, there is a further obstruction even for SO(Q, Z).) 

If such obstacles to independence appear for a given problem, this is a sign that it should 
reworded or rearranged: instead of Y, one should attempt to sieve, for instance, the fibers of 
N. (This is justified by the fact that, in some cases, different fibers may indeed have very 
different behavior, and can not be treated uniformly). 

In our case of sieve in orbits for a group A which is Zariski-dense in the semisimple group 
G/Q, the most efficient method is to reduce first to the connected component G° of G (by 
replacing A with AnG°(Q) and then to the simply- connected covering group G sc of G° using 
the projection map 

vr : G sc -»• G° ; 

one works with the inverse image A sc of A in G SC (Q), and consider the function / = / o it 
instead of /. Note that, in general, either of these operations might require to work over a 
base number field distinct from Q, but there is no particular difficulty in doing so. For a 
detailed analysis in the case of the Apollonian group A, where G = 0{Q) is not connected 
and the connected component SO(Q) is not simply-connected, see [U §6] or [17] . 

Example 5.3. — In the situation of Theorem |1.1[ we have G = SL m , which is connected and 
simply-connected, so these preliminaries are not needed. The same thing happens if G is a 
symplectic group G = Sp 29 , or if G is a product of groups of these two types. 

The following result is the crucial ingredient that shows that a Zariski-dense subgroup in a 
simply-connected group satisfies a strong form of independence of reduction modulo primes: 

Theorem 5.4 (Strong approximation and independence). — Let G be a connected, simply- 
connected, absolutely almost simple Q-group embedded in GL m /Q, and let A C G(Q) n 



GL m (Z) be a Zariski-dense subgroup S- 8 ' There exists a finite set of primes E = E(A) such 
that G has a model, still denoted G, over Z[l/E], and: 

(1) For all primes p not in E, the map 

A -> G(F P ) 

is surjective, i.e., the image A p of reduction modulo p is "as large as possible", so that A p = 
G(F P ). 

(2) For all squarefree integers d coprime with E, the reduction map 

A — >• JJg(F p ) = G(Z/dZ) 

P \d 

is surjective, i.e., we have A^ = G(Z/dZ) and A — > is surjective. 



t 8 ' For instance G = SL m or Sp 
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(3) Assume (ik is the weighted counting method of Section 4-2 associated with a finite sym- 
metric generating set S, with 1 G S. Then, for any integer d coprime with E, the probability 
measures fa : d on 

A d = l[A p = G(Z/dZ) 

P \d 

converge, as k —> +oo, to the uniform probability measure Vd = Tl u p> ^- e -> the measure so that 

^ (7) = \G{z/dz)y for aU 7 G G ( z / dz y 

Parts (1) and (2) have been proved, in varying degree of generality (and with very different 
methods), by a number of people, in particular Hrushovski and Pillai [30], Nori |47| . Matthews- 
Vaserstein-Weisfeiler [H] ; the most general statement is due to Weisfeiler [60] . 

Part (3), on the other hand, is usually not proved a priori, It holds, in fact, also in many 
cases when the two other counting methods are used (i.e., archimedean balls and unweighted 
combinatorial balls) but it is then seen as a consequence the stronger quantitative forms of 
equidistribution (which are parts of the Basic Requirements anyway) . We will say more about 
this in the next sections, but we should observe, However, that these limiting measures are 
certainly the most natural ones that one might expect (being the Haar probability measures 
on the finite groups G(Z/dZ)). 

We explain the quite straightforward proof of (3) for the weighted counting (it is also an 
immediate consequence of the probabilistic interpretation and standard Markov-chain theory) . 

Let (p : Arf — > C be any function. The integral of ip according to fad is 

E^)r^E---E 1 = r^E---E^---^) = ( M V)(i) 

yeA d 1 1 Sl ,...,s k eS 1 1 s 1: ...,s k es 

si---s k =y 

where M is the Markov averaging operator on functions on associated to S, i.e., for any 
/ : Aa — > C and x G A^, we have 

The constant function 1 is an eigenfunction of M with eigenvalue 1. Because S generates 
A (hence A^), it is an eigenvalue with multiplicity 1. Thus if we write 

<P = viri E + Vo> 

I d\ 

y&^d 

we have 

M ^ = u^E^) + M ^0' 
1 dl yeA d 

and therefore 

(19) (MV)(1) " ^ E <P(V) < max|(MVo)(y)| < I A d |/9 (M) fc ||^|| 2 

where po is the spectral radius of the operator M restricted to the space of functions on A^ 
with mean (according to the uniform probability measure) , endowed with the corresponding 
L 2 -norm || • [ | ^ - Having ensured that 1 G S, as we required for the weighted counting method, 
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also implies that —1 is no!*- 9 ) an eigenvalue of M. Since M is symmetric, hence has real 
spectrum, and this spectrum is clearly in [—1,1], it follows that po{M) < 1. Thus 

as — > oo, which is the desired local equidistribution with respect to v<i- 



Remark 5.5. — In fact, independence, in the sense of Theorem 5.4, only holds for simply- 
connected groups. Thus, its conclusion may be taken as a "practical" alternate characteriza- 
tion, with a very obvious interpretation, as far as sieve is concerned at least. 

Now, consider again the cyclic group generated by 2 £ GLi(Z[l/2]). Its image in GLi(F p ) 
(for p odd) is cyclic, and of order the order of 2 modulo p. This remains a very mysterious 
quantity; in particular, it is certainly not the case that 2 generates F* for most p (a well- 
known conjecture of Artin states that this should happen for a positive proportion of the 
primes, but even this remains unknown). Thus the basic principles of sieve break down at 
this early stage for this group. (The best that has been done, for the moment, is to use the 
fact that, for almost all primes £, there is some prime p for which the order of 2 modulo p 
is £, in order to show that 2™ — 1 has, typically, roughly as many small prime factors as one 
would expect in view of its size; see [35] Exercise 4.2]). 

Example 5.6. — For the Apollonian group, Fuchs |17] has determined explicitly the image 
under reduction modulo d of (the inverse image in the simply connected covering of) A, for 
all d. 

For concrete groups, it might also be possible to check Strong Approximation directly. 
For the group L C SL/2(Z) (see ([!])) which has infinite index, for instance, it is clear that L 
surjects to SL^Fp) for all primes p ^ 3, and that it is trivial modulo 3. Then one obtains the 
surjectivity of 

J]SL 2 (F P ) 

P \d 

for all d with 3 \ d using the Goursat Lemma of group theory (which is crucial in the proof of 
(2) in any case). 

5.2. Quantitative equidistribution: combinatorial aspects 

We consider a finitely generated group A C GL m (Z) such that its Zariski-closure G is 
simple, connected and simply connected, together with a symmetric finite generating set S. 

If we select the weighted combinatorial count (assuming that 1 6 S), we see from The- 
orem 5.4 that in order to satisfy the basic requirements of Definition |3.6| for sieving up to 



some level D^, we "only" must check the level of distribution condition (15), provided we 
restrict our attention to sieving using primes p outside the possible finite exceptional set X, 
and squarefree integers d coprime with S. 



In turn, the estimate (19), with ip the characteristic function of a point a, shows that we 



have a quantitative equidistribution result for fixed d, (d, £) = 1, of the type 



\r d ,k( a )\ < VWd\Pd, 

(9) Let S' = S - {1}, \S\ = 1; the operator M can be written (1 - l/s)M' + l/s, M' being the averaging 
operator for the generators S'; since the spectrum of M' lies in [—1, 1], that of M must be in [—1 + 2/s, 1]. 
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valid for all a £ Ad, where pd is the spectral radius of the averaging operator on functions of 
mean zero on A^, which satisfies pd < 1. 

It is therefore obvious that obtaining a level of distribution for A amounts to having an 
upper bound for pd which is uniform in d. 

The best that can be hoped for is that pd ^ p < 1 for all d 1 and some fixed p: the 
exponential rate of equidistribution is then uniform over d. This is equivalent to the well- 
known, incredibly useful, condition that the family of Cayley graphs of with respect to S 
be an expander (see [28J for background on expanders). Precisely: 

Definition 5.7 (Expander family, random- walk definition). — Let (Ti)i^j be a family of 
connected k-regular graphs for a fixed k ^ 1, possibly with multiple edges or loops. The family 
(Ti) is an expander if there exists p < 1, independent of i, such that 

Pi ^ P < 1 

for all i, where pi is the spectral radius of the Markov averaging operator on functions of mean 
onTi, with respect to the inner product 

(/i,M = ^/i(« 

Assuming that we have an expander, with expansion constant p, and that 

\n p \^ P A , \a p \^ p a \ 

we obtain the immediate estimate 

J2\n d \nwx\r d>k (a)\^Di+ A + A ^p k 



for k ^ 1. This provides levels of distribution (as in Definition 3.6) of the type 

(20) D k =p k , for any 1< (3 < p-Vd+A+A^). 

This applied to the weighted counting. One may however expect that, under the condition 
that the Cayley graphs be expanders, a similar level of distribution should hold for combina- 
torial balls also. However, to the author's knowledge, this is currently only known under the 
additional condition that A be a (necessarily non-abelian) free group. When this is the case, 
Bourgain, Gamburd and Sarnak (3J §3.3, (3.29), (3.32)] prove, using the well-known spectral 
theory on free groups, that for the counting measure on the combinatorial ball of radius k. 
there exists r < 1, depending only on the expansion constant p of the family of graphs, such 
that the measures fld,k converge towards the uniform measure Vd on A^ (for d coprime to a 
suitable finite set of primes), with error bounded by 

\r d ,k{a)\ « \B s (k)r l 

for all k ^ 1. 

The restriction to free groups is awkward from some points of view. However, for instance, 
the inverse image of A in the simply-connected covering of SO(Q) is free, so this can be used in 
the case of the Apollonian circle packings. Moreover, as observed in [2], for applications such 
as upper-bounds on saturation numbers, one can use the fact (the "Tits alternative") that, 
under our assumptions on G, any Zariski-dense subgroup A of G contains a free subgroup Az 
which is still Zariski-dense in G. It is then possible to apply sieve to the orbit of xq under Az 
instead of A. 
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It becomes, in any case, of pressing concern to know whether the expansion property holds. 
Here we can reformulate the question in terms of the Cayley graphs of Gr(Z/dZ) for d #s 1 
squarefree, with respect to suitable sets of generators (avoiding a few exceptional primes). 

This question has in fact quite a pedigree. But until very recently, the known examples 



where all lattices in semisimple groups ( 10 * > Indeed, the expansion property can also be phrased 
in the case of interest as stating that the group A has Property (r) of Lubotzky for represen- 
tations factoring through the congruence quotients A — > = G(Z/dZ). Thus it is known, 
by work of Clozel |llj . for A any finite-index subgroup of G(Z). In fact, for many cases of 
great interest, like (finite index subgroups in) SL m (Z) for m ^ 3 or Sp 29 (Z) for g ^ 2, the 
result follows from Kazhdan's Property (T). 

For subgroups A (possibly) of infinite index in G(Z), there has been dramatic progress 



recently partly motivated by the sieve applications. The following theorem has now been 



announced: 



Theorem 5.8 (Expansion in finite linear groups). — Let G/Q be absolutely almost simple, 
connected and simply-connected, embedded in GL m for some m, e.g., G = SL m , m ^ 2, or 
Sp2 9 , 5^1- Let A C G(Q) n GL m (Z) be a finitely- generated subgroup which is Zariski-dense 
in G. Fix a symmetric finite system of generators S of A. Then the family of Cayley graphs, 
with respect to S, of the groups A^ obtained by reduction modulo d of A is an expander family, 
where d runs over squarefree integers. 



This applies, in particular, to the group L of Q; this was essentially a question of Lubotzky. 

Many people have contributed (and still contribute) to the proof of this result (and variants, 
extensions, etc). We do not attempt a complete history or any semblance of proof, but it seems 
useful to sketch the overall strategy that has emerged: 

- [1st step: Growth] A first crucial step, which was first successfully taken by Helfgott [26] 
for G = SL2 and SL3 [27], is to prove a growth theorem in the finite groups G(F P ) for p prime: 
there exists 5 > 0, depending only on G, such that for any generating subset A C G(F P ), we 
have 

(21) \A-A-A\ = \{abc \ a,b,c£ A}\ > min(|G(F p )|, |^| 1+5 ), 

where the implied constant depends only on G. 

Such a result, once known, implies that the diameter of the Cayley graphs is <C (logp) c 
for some constant C ^ 1. This, by itself, suffices - by standard graph theory - to obtain 
an explicit upper-bound for the spectral radius p p , but one which is weaker than the desired 
uniform spectral gap (namely, of the type 1 — p p S> (logp)~ D for some D ^ 0). Although this 
is insufficient for applications to results like Theorem |1.1[ it may be pointed out that this is 
enough for some others, including rather surprising ones in arithmetic geometry |14j . 



'- 10 '' Except for some isolated examples of Shalom 58, Th. 5.2] and, rather implicitly, the examples arising from 
Gamburd's work [TS] on the "spectral" side, detailed in the next section. (Both applied only to d = p prime.) 

A story which is well worth its own account; the excellent survey [24] by B. Green, despite being very 
recent, does not cover many of the most remarkable new results. 
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After Helfgott's breakthrough, growth results were proved by Gill and Helfgott [20J (for 
SL m , with a restriction on A) and (independently and simultaneously) by Breuillard-Green- 
Tao [8J and Pyber-Szabo |51|, Th. 4] in (more than) the necessary generality for our pur- 



pose 



(12) 



An intermediate paper of Hrushovski [29] should be mentioned, since it brought 
to light a somewhat old preprint of Larsen and Pink |36j . from which a useful general in- 
equality emerged (see, e.g, [U Th. 4.1]) concerning (roughly) the size of the intersection of a 
"non-growing" set of G(F p ) and proper algebraic subvarieties of G. 

- [2nd step: Expansion for primes] As mentioned, the growth theorem does not immediately 
imply that the Cayley graphs are expanders. Bourgain and Gamburd [3] were the first to prove 
this property for SL2(F p ). Their method starts with an approach going back to Sarnak and 
Xue [56], which compares upper and lower bounds for the number of loops in the Cayley 
graphs, based at the identity, and of length t « log p. As in [56], the lower bound comes from 
a spectral expansion and the fact that the smallest degree of a non-trivial linear representation 
of SL^Fp) is "large" (namely, it is (jp — l)/2, as proved by Frobenius already). The upper- 
bound relies on a new important and ingenious ingredient, now called "flattening lemma" (0 
Prop. 2]), which is used to show that large girth of the Cayley graphs (a property which is 
fairly easy to prove) is enough to ensure that, after 3> \ogp steps, the random walks on the 
graphs are very close to uniformly distributed, and thus there can't be too many loops of that 
length at the identity. In turn, the proof of the flattening lemma turns out, ultimately, to be 



obtained from Helfgott's growth result (21) in SL2(F p ). (Why is that so? Very roughly, one 
may say that Bourgain and Gamburd show that, if doubling the number of steps 3> \ogp of 
the random walk does not lead to a great improvement of its uniformity, it must be the case 
that its support must be to a large extent concentrated on a set A C SL^Fp) which does 



not grow, i.e., such that (21) is false; according to Helfgott's theorem, this means that A is 
contained in a proper subgroup, but such a possibility is in fact fairly easy to exclude, because 
one started with a random walk using generators of G(F P )). 

After the proof of the general growth theorems, this second step was extended to other 
groups (e.g., it is announced by Breuillard, Green and Tao in [8]). 

- [3rd step: Expansion for squarefree d] This step, which the discussion above has shown to 
be absolutely fundamental for sieve applications, was first done by Bourgain, Gamburd and 
Sarnak for SL2 in [3], using a rather sophisticated argument. However, Varjii [59 J found a 
more streamlined proof, which can be adapted to more general groups, in particular SL m , as 



soon as a growth theorem for G(F p ) is known ( 13 ) More general cases, including the statement 



we have given, have been announced by Salehi Golsefidy and Varjii [53] (who give the most 
general, and in fact, best possible version, which applies to any group such that the connected 
component of identity of the Zariski-closure is perfect) . 

5.3. Quantitative equidistribution: spectral and ergodic aspects 

We now consider a sieve in orbit, for a subgroup A with Zariski-closure G/Q, where we 
count using counting measures on archimedean balls. The results are more fragmentary than 
in the combinatorial case. Certainly, from the discussion above, we see that the main issue 



( 12 ) This increased generality may be very useful for other applications, e.g., the Pyber-Szabo version is quite 
crucial in |14j . 

'- 13 - 1 Note that Bourgain and Varjii 7 also prove the expansion property for SL m (Z/dZ) for all d ^ 1, not only 
those which are squarefree. 
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is to extend the quantitative equidistribution statement (Part (3) of Theorem 5.4) to this 
counting method. However, Parts (1) and (2) still apply. Since 



Hx{l G A | 7 = 70 (mod d)) = ^ 1 

INK* 

7=70 (mod d) 

for any d ^ 1 and 70 £ A^, and this is also 

E 1 

||T7o||<X 
rGA(d) 

where A(d) = ker(A — > Ad) is the c?-th (generalized) congruence subgroup of A, one can see 
that this amounts to issues of uniformity and effectivity in "lattice-point counting" for the 
quotient X\ = A\G(R) and the congruence covers X\(d) = A(d)\G(R). 

For G(R) = SL2(R), A C SL2(Z), and X\ of finite volume, a well-known result of Selberg 
(see, e.g., [311 Th. 15.11]), the original proof of which depends on the spectral decomposition of 
the Laplace operator on X\, proves the local equidistribution with good error term depending 
directly on the first non-zero eigenvalue X\(d) for the Laplace operator on the hyperbolic 
surface A(d)\H. This indicates once more that spectral gaps - of some kind - are crucial tools 
for the quantitative equidistribution. The striking difference with the elementary argument 



leading to ( 10 ) should become clear: instead of counting integers in a (large) interval, where 
the boundary contribution is essentially negligible, we have hyperbolic lattice-point problems, 
where the "boundary" may contribute a positive propertion of the mass. 

It is now natural to distinguish two cases, depending on whether A is a lattice in the real 
points of its Zariski-closure G (always assumed to be simple, connected and simply-connected), 
or whether A has infinite index in such lattices; in terms of X\, the dichotomy has very clear 
meaning: either X\ has finite or infinite volume, with respect to the measure induced from 



a Haar measure on G(R). (Note that we still require Theorem 5.4 to be valid, which means 
that G(R) has no compact factor). 

(1) [Finite- volume case] Although it seems natural to apply methods of harmonic analysis 
on X\(d), similar to Selberg's, there are serious technical difficulties. This is especially true 
when Xa is not compact, since the full spectral decomposition of L 2 (X\) depends then on 
the general theory of Eisenstein series (see the paper of Duke, Rudnick and Sarnak [12] for 
the first results along these lines). 

However, starting with Eskin-McMullen [15], a number of methods from ergodic theory 
have been found to lead to very general results on lattice-point counting in this finite-volume 
case. For the purpose of showing the required quantitative uniform equidistribution (as in 



Theorem 5.4), one may mention first the results of Maucourant [45]; the most general ones 
have been extensively developed by Gorodnik and Nevo [21], |22j (see also [IS]). Without 
saying more (due to a lack of competence) , it should maybe only be said that the incarnation 
of the spectral gap that occurs in this case is the exponent p = p\ > 2 such that matrix 
coefficients of unitary representations occurring in Lq(A(c£)\G(R)) are in LP +£ for all e > 0. 
The existence of such a p > 2 is known from the validity of Property (r). If G(R) has 
Property (T), this constant depends only on G(R), and explicit values are known (due to 
Li [38] for classical groups and Oh in general |48j): for certain groups like SL m , m ^ 3 or 
Sp2 9 ) 5^2, these works give optimal values, as far as the general representation theory of the 
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group G(R) is concerned (the actual truth for congruence subgroups lies within the realm of 
the Generalized Ramanujan Conjectures, and is deeper; see [S3] for more about these aspects.) 



(2) [Infinite volume case ( 14 ) The archimedean counting for these groups is the most delicate 
among the cases currently considered. Indeed, the only examples which have been handled in 
that case are subgroups of the isometry groups of hyperbolic spaces, i.e., of orthogonal groups 
0(n, 1), where the Lax-Phillips spectral approach to lattice-point counting [37] is available, at 
least when the Hausdorff dimension of the limit set of the discrete subgroup A C SO(n, 1)(R) 



is large enoug hF^ This is the case, for instance, for the Apollonian group A (which can 
be conjugated into a subgroup of 0(3, 1)(R)): the limit set has Hausdorff dimension > 1.30, 
whereas the Lax-Phillips lower-bound is 5 > 1. 

Again, due to a lack of competence, no more will be said about the techniques involved, 
except to mention that the presence of a spectral gap for the hyperbolic Laplace operator still 
plays a crucial role; such gaps are established either by methods going back to Gamburd's 
thesis [19], or by extending to infinite volume the comparison theorems between the first 
non-zero eigenvalues for the hyperbolic and combinatorial Laplace operators (due to Brooks 
and Burger in the compact case, see, e.g., (9j Ch. 6]), and applying the corresponding case of 



expansion for Cayley graphs (Theorem 5.8 ) . We will however state a few results of Kontorovich 



and Oh [33J in Section 5.5 and refer to Oh's ICM report |49J for more on the methods involved. 



5.4. Finiteness of saturation number 

We now show how to implement the sieve to prove Theorem |l.l[ using the weighted counting 
method (i.e., implicitly, random walks). It should be quite clear that the method is very 
general. 

Let Y = A, xq and / be as in the theorem, or indeed Zariski-dense in a simple simply- 
connected group G (instead of SL m ). For simplicity, we consider the sieve in Y = A instead 
of the orbit 0(xq); it is straightforward to deduce saturation for the latter from this. We also 
assume that the irreducible components of the hypersurface {/(7X0) = 0} in G are absolutely 



irreducible ( - 16 - ) Fix a symmetric set of generators S with 1 € S. By our previous arguments 



(Theorem 5.4 and Theorem 5.8), the basic requirements of sieve are met. 



We proceed to study the sifted set (12) for the set of primes 7 consisting of those p not in 



the finite "exceptional" set £ given by Theorem |5.4[ and 

% = {7 <E Y p = G(F p ) I /( 7 • (x (mod p))) = G Z/pZ}. 

Indeed, S 2 (A; fi) is the set of 7 G A for which /(7 • xo) has no prime factor < z (outside £). 
After maybe enlarging the set £ (remaining finite), standard Lang- Weil estimates show that 



\Qp\ = K P Aim{G) - 1 + 0(p di 



dim(G)-3/2^ 



'- 14 - 1 Bourgain, Gamburd and Sarnak say that this is the case of "thin" subgroup A, which is appealing termi- 
nology, but - unfortunately - clashes with the meaning of "thin" in Definition [4^4]- no Zariski-dense subgroup 
A C GL m (Z) of G is "thin" in G(Q). 

'- 15 ' 1 Very recent work of Bourgain, Gamburd and Sarnak [5] has started approaching the problem for subgroups 
of SL2 (R) with limit sets of any positive dimension. 

( - 16 ' ) As noted in [4l p. 562], in the simply-connected case, the ring Q[G] of functions on G is factorial; the 
assumption is then that the irreducible factors of / are still irreducible in Q[G]. 
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where k is the number of absolutely irreducible components of the hypersurface in G defined 
by {f{l x o) = 0}, and the implied constant is absolute. Since 

\G(F P )\ = p dim(G) + 0(/ im ( G )- 1 /2) ) 
(which can be checked very elementarily for many groups) it follows that the density of Q p 



satisfies 



Up (Q p ) = ^ + 0(p- 3 / 2 ) 



for all p ^ S. This verifies (17): the sieve in orbit has "dimension" k in the standard sieve 
terminology. We see from (20 ( 17 * > that (18) holds with 

D k = f3 k 

for some /3 > 1 (indeed /3 can be any real number < V( 1 + 3dmi ( G 9/ 2 ) ; where p < 1 is the 
expansion constant for our Cayley graphs, as in Definition 5.7). 

The conclusion is that there are many 7 6 A where 7(7 • xq) is not divisible by primes 
< z = f3 k / s , indeed the /ifc-measure of this set, say is 

1 1 
(log z) K k K 

for k large enough. To prove from this that the saturation number is finite, we need two more 
easy ingredients: 

(1) If 7 £ Sfc ; then the integer n = 7(7 -xo) has a bounded number of prime factors. (Except 
if n = 0, which only happens with much smaller probability, see Remark 3.8). Indeed, we 
have 

n = fi s l ■ ■ ■ SnXo) 

for some s« G S. Since the function / has polynomial growth, we see immediately that there 
exists a constant A ^ 1 such that 

(22) /( 7 • x Q ) « A fe 

for all 7 £ Sfc. An integer of this size, with no prime factor < /3 fc//s , must necessarily satisfy 

Remark 5.9. — If the Cayley graphs satisfy a weaker property than expansion, one can still 
do a certain amount of sieving. However, the level of distribution being weaker, one obtains 
only points of the orbit with fewer prime factors than typically expected for integers of that 
size (see the Appendix for the meaning of this) . 

(2) We must check that the lower bound is incompatible with the set of 7 with £l(f(j-x)) ^ r 
being too small, i.e., thin or simply not Zariski-dense. For the latter this is quite easy: indeed, 
any subset W of A contained in a proper hypersurface {g = 0} of G satisfies the much slower 
growth 

H k {W) « 5- k 

for some 8 > 1, as one can see simply by selecting a suitable prime p for which {g = 0} is a 
hypersurface modulo p and bounding 

fJ>k(W) ^ I 9(7) = (mod p)) 



' 17 ' Remark (3.81 is applicable here to check (16 1. 
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using local equidistribution modulo p and the Lang- Weil estimates. 

In order to show a similar result for thin sets, however, one must apply the large sieve 



instead, as discussed briefly in Section 6.2 



5.5. Other results for the sieve in orbits 

We collect here a few results which have been proved in the setting of the sieve in orbits. 

Example 5.10. — We start with results concerning the Apollonian group and the associated 
circle packings. 

— Fuchs [17j has studied very carefully the reductions modulo integers of the Apollonian 
group. 

— Based on this study, a delicate conjecture predicts a local-global principle for the presence 
of integers among the curvature set 6(c); Bourgain and Fuchs [2] have at least shown 
that the number of integers ^ T arising as curvatures (without multiplicity) is 3> T. 

— Kontorovich and Oh have applied spectral-ergodic counting methods in infinite volume 



to deduce, first, asymptotic formulas for the number of curvatures T,( 18 ) and then 
- by means of sieve - have obtained upper and lower bounds for the number of prime 
curvatures, or the number of pairs of prime curvatures of two tangent circles in the 
packing (such as 11 and 23 in Figure 1). Note that here, counting in the orbit means 
that the Lax-Phillips theory does not apply, and thus new ideas are needed. They also 
did a similar analysis for orbits of infinite-index subgroups A of SO(2, 1)(Z) acting on 



the cone of Pythagorean triples (see Remark 4.3), see [33]; remarkably, using Gamburd's 



explicit spectral gap [19], they obtain for instance - for sufficiently large limit sets, but 
possiby infinite index - the expected proportion of triangles with hypothenuse having 
^ 14 prime factors. 

Example 5.11. — The integral points V m>n of the SL m -homogeneous spaces 

V m ,n = {7 6 GL m I det(7) = n) 

have been studied in great detail by Nevo and Sarnak [46J, in the setting of archimedean 
balls, using methods based on mixing and ergodic theory. They show, for instance, that 
if / £ Q[^m,n] is integral valued on V m>n , absolutely irreducible and has no congruence 
obstruction to being prime (i.e., for any prime p, there exists 7 G V m , n with p \ 7(7)), then 
the saturation number of V m ^ n is 

< 1 + I8m 3 e deg(f), 
where m e is the smallest even integer m — 1, in fact that 

(23) |{ 7 € V m , n I || 7 || < T and H(/( 7 )) < r}\ » LLl^^M<lH j 

for r > 18mgdeg(/). 

< - 18 - 1 Counted with multiplicity; the latter, on average, is quite large: about J 1 " 5-1 where S > 1.3 is the dimension 
of the limit set. 
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Example 5.12. — As explained in |46j . bounds like (23) do not transfer trivially to non- 
principal homogeneous spaces (i.e., orbits of an arithmetic group with non-trivial stabilizer), 
although this is no problem when the mere finiteness of a saturation number is expected. 
Gorodnik and Nevo |214 [22] have obtained results which extend such results to many cases. 
Their results apply, for example, to the orbits 

0(so) ={ff£ M m (Z) | g = * 7 5o7 for some 7 G SL m (Z)} 

for a fixed non-degenerate symmetric integral matrix go, if m #s 3. Thus, for suitable functions 
/, k and (explicit) r, they prove a lower bound 

\{ 9 e Ota) I |M < T and IK/ W ) < r>| » ife 6 ^ L »f ' < r " 

(logT) K 

(here the stabilizer is an orthogonal group). 



6. RELATED SIEVE PROBLEMS AND RESULTS 

We present here other developments of sieve in expansion, as well as some analogues over 
finite fields. 

6.1. Geometric examples 



In the spirit of Section 2.2 there are a number of geometric situations where one naturally 
wonders about "genericity" properties of elements in interesting discrete groups not given 
as subgroups of some GL m (Z). Sometimes, using arithmetic quotients, and their reductions 
modulo primes, is enough to attack very interesting problems, as we described already for the 
homology of Dunfield-Thurston 3-manifolds. 

For these, the discrete group involved is the mapping class group T g of a surface T, g of genus 
g. It is finitely generated, and because rather little is known about the precise structure of 
combinatorial balls, it is natural (as done in [13J ) to use a weighted combinatorial counting to 
apply sieve in that case, or in other words, to use a random walk on T g based on a symmetric 
generating set S, with 1 G S for simplicity. 

Since (|2j) and (|3j) only depend on the image of a mapping class <p in Sp 2fl (Z) or Sp 2s (F p ), 
one is - in effect - doing a random walk (though not always with uniformly probable steps) 
on the discrete group Sp 2fl (Z). Since, for g 2 (which is most interesting) this group has 
Property (T), the basic requirements of sieve hold. 

In addition, it is not difficult to compute the size of 

n p = { 7 G Sp 2ff (F p ) | <J p , 7 J p )^F^}, 

which is of size + 0(p~ 2 ) for p ^ 2 (for fixed g; intuitively a determinant must be zero 
for this to hold, and this happens with probability roughly 1/p). Thus the homology of 
Dunfield-Thurston 3-manifolds can be handled with a sieve of dimension 1. 

If 4>k is the k-th step of a random walk on T g (with respect to a generating set), using 



notation from Section 3.2, the set Y corresponds to those manifolds with Hi(M ( j )k , Z) which 



is infinite. As in Remark 3.8, this event has probability to (proved in [13]) exponentially 



fast as k -» +oo ([35, Pr. 7.19 (1)]). 
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One can then also deduce that 

P(Hi(M e / )k ,Z) has no p-part for p < z = j3 k ) X 

K 

for some (3 = (3(g) > 1. Using the description ([2]), we see also that if H\(M^ k ,7i) is finite, 
its order can not be too large, more precisely there exists A ^ 1 such that the product A/% of 
those p with Hi(M^ k ,F p ) ^ satisfies 

(because divides a non-zero determinant of such size). So by comparison, we deduce that 
there exists r (depending on g and the generators S) such that 

P(Hi(M ( j >k , Z) is finite and has order divisible by ^ r primes) » — . 

In another direction, an application of the large sieve shows that, with probability going 
to 1, \Hi(M ( j, k , Z)| is divisible by "many" primes < z. This means \H\{M^, k , Z)| is typically 
finite, but very large (see [351 P r - 7.19 (2)]). 

Other examples of groups where sieve can be applied are given by automorphisms of free 
groups (of rank m #s 2, where the action on the abelianization gives a quotient SL m (Z)). 
Thus, sieve methods give another illustration of the many analogies between these discrete 
groups (others are surveyed in the recent talk [50] of F. Paulin in this seminar); see also the 
related works of Rivin E21 and Maher 



6.2. Large sieve problems 

We have briefly mentioned the large sieve already, and we will now add a few words (see [35] 



for much more on this topic). In the context of Section 3.2, and starting with some work of 
Linnik, many sieve situations have appeared where the condition sets f2 p satisfy 

(24) z/ p (fi p ) ^5>0 

for some 5 > and all primes p; because, for the classical case, this amounts to excluding 
many residue classes, it is customary to speak of a large sieve situation. 

The large sieve method, under the assumptions of Basic Requirements (local independent 
equidistribution and its quantitative version) leads roughly to two types of statements: 

(1) An upper-bound for fj, n (S z (Y; U)) of the type 



(19) 



p n (S z (Y; n)) « »n(Y)H-\ H = Y,^ d ? II i 



(for z of size similar to the level of distribution; see |35[ Prop. 2.3, Cor. 2.13]). In the sieve 
in orbits, this can be used to show that 

fin(W) « 5- k 

for some 5 > 1, where W C G(Q) n GL m (Z) is a thin set, using the fact (see [571 Th. 3.6.2]) 
that the complement f2 p of W (mod p) satisfies a large-sieve condition: 

|O p | > 1 

for p large enough. This extends the finiteness of saturation numbers to thin sets. 
Where it is not necessary to assume, a priori, that (24 1 holds. 



(19) 
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(2) An upper-bound for the mean-square of 

( E i-E^: 

p<z p<z 
y (mod p)Sf2 p 

with respect to \i n (see [35] Prop. 2.15]). This leads to the fact that the number of p < z 
such that y (mod p) is in Q, p is close to the expected value 

E^(^) 

p<z 

with high probability. 

This is used for instance to show that the homology of the 3-manifolds has typically a very 
large torsion part (growing faster than any polynomial, as k — > +oo). 

Another application of the large sieve, in settings related to discrete groups, concernes the 
question of trying to detect the "typical" Galois group of the splitting field of the characteristic 
polynomial of an element x in a subgroup A C GL m (Z). The (quite classical) idea is to use 
Frobenius automorphisms at primes to produce conjugacy classes in the Galois group. If, for 
instance, A is Zariski-dense in SL m , the factorization pattern of the characteristic polynomial 
modulo p gives a conjugacy class c in the symmetric group & m , which is the maximal possible 
Galois group for the characteristic polynomial. Since it is not too difficult to show that 

\{g G SL m (F p ) | the conjugacy class associated to g is c}| 



SL m (F p 



for fixed m, conjugacy class c and p — > +oo, this is a condition like (24) when v v is the uniform 
probability measure on SL m (F p ). One can then prove that the Galois group is as large as 
possible, with probability going to 1 (see [52], [35], Th. 7.12] and, for a very general statement, 
the recent work of the author with F. Jouve and D. Zywina [32], where the typical Galois 
group is essentially the Weyl group of the Zariski closure of A.) 

Finally, very recently, Lubotzky and Meiri [JT] have used the large sieve (and the expansion 
result of Salehi Golsefidy and Varju [53]) in order to prove that, if T is a finitely-generated 
subgroup of GL m (C) which is not virtually solvable, one has 

P(Afe is of the form j m for some 7 G T and m ^ 2) <C exp(— /3 k) 

for every left-invariant random walk (X^) on T defined using a symmetric generated set S 
of T (with 1 G r), where (3 > depends on S. This result does not have any obvious 
"classical analogue" , and is a strong form of a converse of a result of Mal'cev. Moreover, the 
proof involves many subtle group-theoretic ingredients in addition to the sieve, and hence this 
theorem seems to be an excellent illustration of the potential usefulness of sieve ideas as a 
new tool in the study of discrete groups. 

6.3. Sieve for Frobenius over finite fields 

There are a number of interesting analogies between the type of sieve problems in Section [4] 
and problems of arithmetic geometry over finite fields which concern the properties of the 
action (typically, characteristic polynomials) of Frobenius elements associated to families of 
algebraic varieties over finite fields. In this context, instead of expansion properties, one uses 
the Riemann Hypothesis over finite fields (and uniform estimates for Betti numbers) to prove 
the required equidistribution properties (quantitative uniform versions of the Chebotarev 
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density theorem). We refer to |35l §8, App. A] for precise descriptions and sample problems, 
especially in large-sieve situations (which were already implicit in work of Chavdarov |10j). 
and only mention that a prototypical question is the following: given / £ F P [A] of degree 2g 
and without repeated roots, and the family of hyperelliptic curves given by 

C t : y 2 = f(x)(x-t) 

where t is the parameter, how many t £ F p » are there such that |Ct(F p ^)| is prime, or almost 
prime? 

One may also consider the case of a fixed algebraic variety over a number field, and the 
variation with p of its reductions modulo primes. The principles of the "sieve for Frobenius" 
(now, in some sense, in horizontal context) are still applicable, though they suffer from the 
lack of Riemann Hypothesis (or even strong enough versions of the Bombieri- Vinogradov 
Theorem), and the unconditional results are therefore fairly weak (see |10j and |61j). 



7. REMARKS, PROBLEMS AND CONJECTURES 

We conclude by describing some open interesting problems and other related works. 

(1) [Conjugacy classes] Let A C SL m (Z) be a Zariski-dense subgroup, of infinite index. 
The theory and results described previously give information - theorems or conjectures - 
concerning the distribution of the elements of A and, in some sense, their density among the 
elements of SL m (Z). Now one may ask: what about the set of conjugacy classes of A? This 
seems like a very natural and interesting question, and even in the case of m = 2 it does not 
seem (to the author's knowledge) that much is known. 

(2) [Strong equidistribution for word-length metric] It would be of great interest to obtain 



a version of Part (3) of Theorem 5.4 for the (unweighted) word-length counting method when 
A is a fairly general group with exponential growth (in particular, when it is not free). 

(3) [Explicit bounds] We have concentrated on general results, which in some sense are 
quite basic from the point of view of applying sieve. It is natural that now much effort 
goes into improving the results, and in particular in obtaining explicit bounds for saturation 



numbers, ( 20 ) or explicit quantitative lower-bounds. We have mentioned examples like those 



of Nevo-Sarnak or Gorodnik-Nevo for lattices and archimedean balls. These, as well as the 



argument in Section 5.4, indicate clearly that a first inevitable step is to prove an explicit 
version of spectral gap. In the ergodic setting, this comes ultimately from spectral theory, 
and the gap is quite explicit (this goes back to Selberg's famous 3/16 theorem). It would be 
extremely interesting to have, for instance, a version of Theorem |5.8| (even, to begin with, 
for SL2) in which the expansion constant for the Cayley graph is a known function of, say, 
the coordinates of the matrices in the generating set S. As pointed out by E. Breuillard, the 
issue is not the effectiveness of the methods (for instance, there is no issue comparable to 
the Landau-Siegel for zeros Dirichlet L-functions) : the methods and results that lead to this 
theorem are effective in principle (but one must be careful when general groups are involved 
and "effective" algebraic geometry is needed). 

( 20 ) Where "explicit" means having a concrete number, be it 10, 100 or 1000 for a concrete case like, for 
instance, the group L and the polynomial f(-y) = product of the coordinates. 
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(4) [Refinements] Once - or when - an explicit spectral gap is known, one can envision the 
application of more refined versions of sieve; this has been done, e.g., by Liu and Sarnak [39J 
for sieving integral points on quadrics in three variables, where a sophisticated weighted sieve 
is brought to bear. 

Along these lines, it would be extremely interesting also to find examples where Iwaniec's 
Bilinear Form of the remainder term for the linear sieve (see §12.7]) was exploited. Sim- 
ilarly, it would be remarkable to have applications where the level of distribution is obtained 
by a non-trivial average estimate of the remainders r^, instead of summing individual esimates 
(this being the heart of the Bombieri- Vinogradov theorem). 

(5) [Primes?] In many cases, when there are no congruence obstructions, one expects that 



the saturation number be 1, i.e., that many elements of an orbit have f(x) be prime ( 21 ) 
For instance, Bourgain, Gamburd and Sarnak propose HJ Conjecture 1.4] a fairly general 
conjecture concerning the value of the saturation number. The paper of Fuchs and Sanden \18\ 
Conj. 1.2, 1.3] gives two very precise quantitative conjectures concerning prime curvatures 
of Apollonian circle packings, which are quite delicate (and shows that making quantitative 
conjectures is rather subtle in such settings). Some results with primes are known, but the 
methods used are more directly comparable with those of Vinogradov and the circle method 



than with sieve: Nevo and Sarnak [46} Th. 1.4] find a Zariski-dense subset of V m , n (see (5.11 )) 
where all coordinates of the matrix are primes (up to sign), under the necessary condition 
that n = (mod 2 m_1 ), and Bourgain and Kontorovich [6j show that (for instance) the set 
of all integers arising as (absolute value of) the bottom-right corner of an element in a thin 
subgroup of SL2(Z) with sufficiently large limit set contains all positive integers ^ N with 
<C iV 1- " 5 exceptions - in particular, infinitely many primes - for N large enough. 

One may then also ask: "what is the strength of such statements, if valid" ? What do they 
mean about prime numbers? The only clue in that direction - to the author's knowledge - is 
the following indirect fact: Friedlander and Iwaniec have shown (see [16, §14.7]) that one can 
prove that there is the expected proportion of matrices 



a b 
c d 



e SL 2 (Z) 



with 



a 2 + b 2 + c 2 + d 2 = p prime, p ^ X, 



provided a suitable form of the Elliott-Halberstam conjecture holds (level of distribution Q = 
XV 2 + 5 , for some small 5 > 0, for primes ^ X). This is a type of sieve in orbits, obviously. 
The assumption here is widely believed to be true, but seems entirely out of reach (e.g., it is 
much stronger than the Generalized Riemann Hypothesis). It is now known (by the work of 
Goldston, Pintz and Yildirim) that this would also imply the existence of infinitely gaps of 
bounded size between consecutive primes (see, e.g., [161 Th. 7.17]). 



' 21 ) More precisely, prime or opposite of a prime; for rather fundamental reason, explained in [3J §2.3], one 
cannot hope to distinguish between these two. 
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APPENDIX: WHAT TO EXPECT FROM INTEGERS 

We recall here, very briefly and for completeness, the most basic estimates concerning 
multiplicatives properties of integers. These serve as comparison points for statements on the 
distribution of prime factors of elements of any set of integers. Of course, all these facts are 
known in much stronger form than what we state. 

— The number of primes p ^ X is asymptotic to X/(logX) (the Prime Number Theorem). 

— More generally, for k 1 (fixed), the number of integers n $J X which are product of k 
(or ^ k) prime factors, is asymptotic to 

1 A^loglogA)^ 1 
(fc^l)! (logX) ~- 

— On the other hand, for k ^ 1 (fixed), the number of integers n ^ X which have no 
prime factor p ^ X 1 ^ is of order x X/(logX). Note that this set is a subset of the 
previous one (when ^ k prime factors are considered), but the restriction on the size of 
prime factors is more stringent than the restriction on their numbers, and the order of 
magnitude becomes insensitive to the number of prime factors k. 

— The typical number of prime divisors of an integer n ^ X is log log X; indeed, we have 
the Hardy-Ramanujan variance bound 

fan) - log log X) 2 < X log log X, 

so that, e.g., there are 

X 

log log X 

integers ^ X with \Q(n) — log log X\ ^ (loglogX)/2. 
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